Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Correcting Different Types of Errors in Texts

Identifieur interne : 000393 ( Main/Exploration ); précédent : 000392; suivant : 000394

Correcting Different Types of Errors in Texts

Auteurs : Aminul Islam [Canada] ; Diana Inkpen [Canada]

Source :

RBID : ISTEX:A5985BE5EDF2A8F996A278699D22B8E28B3D8736

Abstract

Abstract: This paper proposes an unsupervised approach that automatically detects and corrects a text containing multiple errors of both syntactic and semantic nature. The number of errors that can be corrected is equal to the number of correct words in the text. Error types include, but are not limited to: spelling errors, real-word spelling errors, typographical errors, unwanted words, missing words, prepositional errors, punctuation errors, and many of the grammatical errors (e.g., errors in agreement and verb formation).

Url:
DOI: 10.1007/978-3-642-21043-3_23


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct:series">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Correcting Different Types of Errors in Texts</title>
<author>
<name sortKey="Islam, Aminul" sort="Islam, Aminul" uniqKey="Islam A" first="Aminul" last="Islam">Aminul Islam</name>
</author>
<author>
<name sortKey="Inkpen, Diana" sort="Inkpen, Diana" uniqKey="Inkpen D" first="Diana" last="Inkpen">Diana Inkpen</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:A5985BE5EDF2A8F996A278699D22B8E28B3D8736</idno>
<date when="2011" year="2011">2011</date>
<idno type="doi">10.1007/978-3-642-21043-3_23</idno>
<idno type="url">https://api.istex.fr/document/A5985BE5EDF2A8F996A278699D22B8E28B3D8736/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001671</idno>
<idno type="wicri:Area/Istex/Curation">001577</idno>
<idno type="wicri:Area/Istex/Checkpoint">000049</idno>
<idno type="wicri:doubleKey">0302-9743:2011:Islam A:correcting:different:types</idno>
<idno type="wicri:Area/Main/Merge">000398</idno>
<idno type="wicri:Area/Main/Curation">000393</idno>
<idno type="wicri:Area/Main/Exploration">000393</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Correcting Different Types of Errors in Texts</title>
<author>
<name sortKey="Islam, Aminul" sort="Islam, Aminul" uniqKey="Islam A" first="Aminul" last="Islam">Aminul Islam</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Canada</country>
<wicri:regionArea>University of Ottawa, Ottawa</wicri:regionArea>
<wicri:noRegion>Ottawa</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Canada</country>
</affiliation>
</author>
<author>
<name sortKey="Inkpen, Diana" sort="Inkpen, Diana" uniqKey="Inkpen D" first="Diana" last="Inkpen">Diana Inkpen</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Canada</country>
<wicri:regionArea>University of Ottawa, Ottawa</wicri:regionArea>
<wicri:noRegion>Ottawa</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Canada</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2011</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">A5985BE5EDF2A8F996A278699D22B8E28B3D8736</idno>
<idno type="DOI">10.1007/978-3-642-21043-3_23</idno>
<idno type="ChapterID">23</idno>
<idno type="ChapterID">Chap23</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: This paper proposes an unsupervised approach that automatically detects and corrects a text containing multiple errors of both syntactic and semantic nature. The number of errors that can be corrected is equal to the number of correct words in the text. Error types include, but are not limited to: spelling errors, real-word spelling errors, typographical errors, unwanted words, missing words, prepositional errors, punctuation errors, and many of the grammatical errors (e.g., errors in agreement and verb formation).</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Canada</li>
</country>
</list>
<tree>
<country name="Canada">
<noRegion>
<name sortKey="Islam, Aminul" sort="Islam, Aminul" uniqKey="Islam A" first="Aminul" last="Islam">Aminul Islam</name>
</noRegion>
<name sortKey="Inkpen, Diana" sort="Inkpen, Diana" uniqKey="Inkpen D" first="Diana" last="Inkpen">Diana Inkpen</name>
<name sortKey="Inkpen, Diana" sort="Inkpen, Diana" uniqKey="Inkpen D" first="Diana" last="Inkpen">Diana Inkpen</name>
<name sortKey="Islam, Aminul" sort="Islam, Aminul" uniqKey="Islam A" first="Aminul" last="Islam">Aminul Islam</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000393 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000393 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:A5985BE5EDF2A8F996A278699D22B8E28B3D8736
   |texte=   Correcting Different Types of Errors in Texts
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024